BIOTECHNOLOGY JftB ep-B 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Application Serial Number: 0^/^O2 f *? / 3 



Source: 
Date Processed by STIC: 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY 
EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT 
COMMUNICATION TO THE APPLICANT, WITH A NOTICE TO COMPLY 
or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, 
WITH A NOTICE TO COMPLY 

FOR FURTHER INFORMATION, PLEASE TELEPHONE MARK SPENCER, 
703-308-4212. 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM . ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 

Checker Version 3.0 

The Checker Version 3 .0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§ 1 .82 1 - 1 .825 effective October- 1 , 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 



Checker Version 3.0 can be down loaded from the USPTO we bsite at the following address: 

http://www.uspto.gov/web/offices/pac/checker 



^kaw Sequence Listing Err Jfcummary 



NUMBER: 



SERIAL 



ERROR DETECTED SUGGESTED CORRECTION 
ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH -ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 



1 



Wrapped Nucleics The numberAext at the end of each line "wrapped" down to the next line. 

This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3, as this will prevent "wrapping". 

Wrapped Aminos The amino acid numberAext at the end of each line "wrapped " down to the next line. 

This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3, as this will prevent "wrapping". 

Incorject Line Length The rules require that a line not exceed 72 characters in length. This includes spaces. 



Misaligned Amino Acid 
Numbering 

Non-ASCII 



Variable Length 



Patentln ver. 2.0 "bug" 



The numbering under each 5th amino acid is misaligned. This may be caused by the'use of tabs 
between the numbering. It is recommended to delete any tabs and use spacinofBetweerr the numbers. 

This file was not saved in ASCII (DOS) text, as required by the Sequence Rules. 

Please ensure your subsequent submission is saved in ASCII text so that it can be processed. 

Sequence(s) contain n's or Xaa*s which represented more than one residue. 

As per the rules, each n or Xaa can only represent a single residue. 

Please present the maximum number of each residue having variable length and 

indicate in the (ix) feature section that some may be missing. 

A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequence(s) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section 
to the subsequent amino acid sequence. This applies primarily to the mandatory <220>-<223> 
sections for Artificial or Unknown sequences. 



Skipped Sequences 
(OLD RULES) 



Skipped Sequences 
(NEW RULES) 



Sequence(s) missing. If intentional, please use the following format for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: 

(i) SEQUENCE CHARACTERISTICS:(Do not insert any headings under "SEQUENCE CHARACTERISTICS") 
(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: 
This sequence is intentionally skipped 

Please also adjust the "(iii) NUMBER OF SEQUENCES:" response to include the skipped sequence(s). 

Sequence(s) missing. If intentional, please use the following format for each skipped sequence. 

<210> sequence id number 
<400> sequence id number 
000 



10 \J Use of n's or Xaa's Use of n's and/or JCaa's have been detected in the Sequence Listing. 

(NEW RULES) Use of <220> to <223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 



11 



12 



13 



Use of <213>Organism 
(NEW RULES) 

Use of <220>Feature 
(NEW RULES) 



Patentln ver. 2.0 "bug" 



Sequence(s) . 



are missing this mandatory field or its response. 



Sequence(s) are missing the <220>Feature and associated headings. 

Use of <220> to <223> is MANDATORY if <21 3>ORGANISIvT is "Artificial" or "Unknown" ^J'S 
Please explain source of genetic material in <220> to <223> section. 

(See "Federal Register," 6/01/98, Vol. 63', No. 104, pp. 29631-32) (Sec. 1.823 of new Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted 

file, resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence listing). 

Instead, please use "File Manager or any other means to copy file to floppy disk. 



AKS-Biotechnology Systems Branch- 5/15/99 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/402,713 



DATE : 08/29/2000 
TIME: 16:50:53 



Input Set : A:\seqlist-1619 0020001.txt 
Output Set: N:\CRF3\08292000\I402713.raw 



and Methods of Use 




4 <110> APPLICANT: Bussemakers , Marion J.G. 
6 <120> TITLE OF INVENTION : PCA3, PCA3 Genes 
8 <130> FILE REFERENCE: 1619.0020001 

10 <140> CURRENT APPLICATION NUMBER : US 09/402,713 

11 <141> CURRENT FILING DATE: 2000-06-13 

13 <150> PRIOR APPLICATION NUMBER: PCT/CA98/0034 6 

14 <151> PRIOR FILING DATE: 1998-04-09 

16 <150> PRIOR APPLICATION NUMBER: US 60/041,836 

17 <151> PRIOR FILING DATE: 1997-04-10 
19 <160> NUMBER OF SEQ ID NOS : 8 

21 <170> SOFTWARE: Patentln Ver . 2.1 

23 <210> SEQ ID NO: 1 

24 <211> LENGTH: 2037 

25 <212> TYPE: DNA 

2 6 <213> ORGANISM: Unknown Organism 

28 <220> FEATURE: 

29 <221> NAME/KEY: CDS 

30 <222> LOCATION: (379).. (531) 
32 <220> FEATURE: 

3 3 <223> OTHER INFORMATION: Description of Unknown Organism: PCA3 

3 5 <4 00> SEQUENCE: 1 

36 agaagctggc atcagaaaaa cagaggggag atttgtgtgg ctgcagccga gggagaccag 60 
38 gaagatctgc atggtgggaa ggacctgatg atacagagga attacaacac atatacttag 120 
40 tgtttcaatg aacaccaaga taaataagtg aagagctagt ccgctgtgag tctcctcagt 180 
42 gacacagggc tggatcacca tcgacggcac tttctgagta ctcagtgcag caaagaaaga 240 
44 ctacagacat ctcaatggca ggggtgagaa ataagaaagg ctgctgactt taccatctga 300 
46 ggccacacat ctgctgaaat ggagataatt aacatcacta gaaacagcaa gatgacaata 360 

4 8 taatgtctaa gtagtgac atg ttt ttg cac att tec age ccc ttt aaa tat 411 
4 9 Met Phe Leu His lie Ser Ser Pro Phe Lys Tyr 

50 1 5 10 

53 cca cac aca cag gaa gca caa aag gaa gca cag aga tec ctg gga gaa 4 59 

54 Pro His Thr Gin Glu Ala Gin Lys Glu Ala Gin Arg Ser Leu Gly Glu 

55 15 20 25 

57 atg ccc ggc cgc cat ctt ggg tea teg atg age etc gee ctg tgc ctg 507 

58 Met Pro Gly Arg His Leu Gly Ser Ser Met Ser Leu Ala Leu Cys Leu 

59 30 35 40 

61 gtc ccg ctt gtg agg gaa' gga cat tagaaaatga attgatgtgt tccttaaagg 561 

62 Val Pro Leu Val Arg Glu Gly His 

63 45 50 

65 atgggcagga aaacagatcc tgttgtggat atttatttga aegggattae agatttgaaa 621 
67 tgaagtcaca aagtgagcat taccaatgag aggaaaacag acgagaaaat cttgatggct 681 
69 tcacaagaca tgcaacaaac aaaatggaat actgtgatga catgaggcag ccaagctggg 741 
71 gaggagataa ccacggggca gagggtcagg attctggccc tgctgcctaa actgtgcgtt 801 
73 cataaccaaa tcatttcata tttctaaccc tcaaaacaaa gctgttgtaa tatctgatct 861 
75 ctacggttcc ttctgggccc aacattctcc atatatccag ccacactcat ttttaatatt 921 
77 tagttcccag atctgtactg tgacctttct acactgtaga ataacattac tcattttgtt 981 
79 caaagaccct tcgtgttgct gectaata'tg tagctgactg tttttcctaa ggagtgttct 1041 



Does Not Comply 
Corrected Diskette Needed 
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RAW SEQUENCE LISTING DATE: 08/29/2000 

PATENT APPLICATION : US/09/402 , 713 TIME: 16:50:53 

Input Set : A:\seqlist-1619 0020001.txt 
Output Set: N:\CRF3\08292000\I402713.raw 



81 
83 
85 
87 
89 
91 
93 

W--> 95 
W--> 97 

99 
101 
103 
105 
107 
109 
111 
113 
116 
117 
118 
119 
W--> 120 
120 
122 
123 
124 
126 
127 
129 
130 
132 
133 
137 
138 
139 
140 
142 
143 
145 
146 
147 
148 
149 
150 
151 
152 
153 
154 
155 



ggcccagggg atctgtgaac 
actagcacac agcatgatca 
tttgcccata ctgaaattca 
attccattaa tatcacagga 
gcagctatgg gaatttaatt 
atccctcccc tttgtttgat 
gaggctgtat Acagcacagc 
cccctcccatfnVsacctaaa 
cattrttcct *tctgcctgag 
tttgaataag ttgactatct 
acatgagaca gcaaatacta 
aatgcaagak ccacagaggg 
aaaggcaggg aacctcatag 
atatccaaca agcttttcac 
catttcatgg tgagtgcgct 
ttgagatgtg tttgtccttg 
tagggttcac tcctggcaat 
<210> SEQ ID NO: 2 
<211> LENGTH: 51 
<212> TYPE: PRT 

<213> ORGANISM: Unknown Organism 

<220> FEATURE: ^ y - ■ ■ ~ 

<223> OTHER INFORMATION: Description of Unknown Organism: PCA3 
<40.0> SEQUENCE: 2 

Met Phe Leu His lie Ser Ser Pro Phe Lys Tyr Pro His Thr Gin Glu 
5 10 15 



aggctgggaa gcatctcaag atctttccag ggttatactt 1101 
ttacggagtg aattatctaa tcaacatcat cctcagtgtc 1161 
tttcccactt ttgtgcccat tctcaagacc tcaaaatgtc 1221 
ttaacttttt tttttaacct ggaagaattc aatgttacat 1281 
acatattttg ttttccagtg caaagatgac taagtccttt 1341 
tttttttcca gtataaagtt aaaatgctta gccttgtact 1401 
ctctccccat ccctccagcc ttatctgtca tcacca±caa 14 61 
caaaatctaa cttgtaattc cttgaacatg tcaggffJbata 1521 
aagctcttcc ttgtctctta aiytctagaat: gatgtaaagt 1581 
tacttcatgc aaagaaggga cacatatgag attcatcatc 1641 
aaagtgtaat ttgattataa gagtttagat aaatatatga 1701 
aatgtttatg gggcacgttt gtaagcctgg gatgtgaagm 1761 
tatcttatat aatatacttc atttctctat ctctatcaca 1821 
agaattcatg cagtgcaaat ccccaaaggt aacctttatc 1881 
ttagaatttt ggcaaatcat actggtcact tatctcaact 1941 
tagttaattg aaagaaatag ggcactcttg tgagccactt 2001 
aaagaattta caaaga 2037 



Ala Gin Lys Glu Ala Gin Arg Ser Leu Gly Glu Met Pro Gly Arg His 

"1 (\ in 



ion of Unknown Organism: PCA3 



Leu Gly Ser Ser Met Ser Leu Ala Leu Cys Leu Val Pro Leu Val Arg 

35 40 45 

Glu Gly His 
50 

<210> SEQ ID NO: 3 
<211> LENGTH: 1872 
<212> TYPE: DNA 

<213> ORGANISM: Unknown Organism 
<220> FEATURE: 

<223> OTHER INFORMATION: Descript 
<400> SEQUENCE: 3 • 
agaagctggc atcagaaaaa cagaggggag 
gaagatctgc atggtgggaa ggacctgatg 
gactttacca tctgaggcca cacatctgct 
agcaagatga caatataatg tctaagtagt 
aaatatccac acacacagga agcacaaaag 
ggccgccatc ttgggtcatc gatgagcctc 
ggacattaga aaatgaattg atgtgttcct 
gtggatattt atttgaacgg gattacagat 
aatgagagga aaacagacga gaaaatcttg 
tggaatactg tgatgacatg aggcagccaa 



atttgtgtgg 
atacagaggt 
gaaatggaga 
gacatgtttt 
gaagcacaga 
gccctgtgcc 
taaaggatgg 
ttgaaatgaa 
atggcttcac 
gctggggagg 



ctgcagccga 
gagaaataag 
taattaacat 
tgcacatttc 
gatccctggg 
tggtcccgct 
gcaggaaaac 
gtcacaaagt 
aagacatgca 
agataaccac 



gggagaccag 60 
aaaggctgct 120 
cactagaaac 180 
cagccccttt 240 
agaaatgccc 300 
tgtgagggaa 360 
agatcctgtt 420 
gagcattacc 480 
acaaacaaaa 540 
ggggcagagg 600 
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RAW SEQUENCE LISTING DATE: 08/29/2000 

PATENT APPLICATION: US/09/402,713 TIME: 16:50:53 

Input Set : A:\seqlist-1619 0020001.txt 
Output Set: N:\CRF3\08292000\l402713.raw 



156 
157 
158 
159 
160 
161 
162 
163 
164 
165 
166 
W--> 167 
W--> 168 
W--> 169 
170 
171 
172 
173 
174 
175 
176 
177 
180 
181 
182 
183 
185 
186 
188 
189 
190 
191 
192 
193 
194 
195 
196 
197 
198 
199 
200 
201 
202 
205 
206 
207 
208 
210 
211 



gtcaggattc 
taaccctcaa 
ttctccatat 
ctttctacac 
aatatgtagc 
tgggaagcat 
ggagtgaatt 
ccacttttgt 
cttttttttt 



tggccctgct 
aacaaagctg 
atccagccac 
tgtagaataa 
tgactgtttt 
ctcaagatct 
atctaatcaa 
gcccattctc 
taacctggaa 



attttgtttt ccagtgcaaa 
tttccagtat aaagttaaaa 



gcctaaactg tgcgttcata accaaatcat ttcatatttc 
ttgtaatatc tgatctctac ggttccttct gggcccaaca 
actcattttt aatatttagt tcccagatct gtactgtgac 
cattactcat tttgttcaaa gacccttcgt gttgctgcct 
tcctaaggag tgttctggcc caggggatct gtgaacaggc 
ttccagggtt atacttacta gcacacagca tgatcattac 
catcatcctc agtgtctttg cccatactga aattcatttc 
aagacctcaa aatgtcattc cattaatatc 
gaattcaatg ttacatgcag ctatgggaat 
gatgactaag tcctttatcc ctcccctttg 
tgcttagcct tgtactgagg ctgtatacag 



ccccatccct ccagccttat ctgtcatcac catcaacccc tcccatnysa 



atctaacttg taattccttg 
tcttccttgt ctcttaantc 

tcatgcaaag aagggacaca 
tgtaatttga ttataagagt 
tttatggggc acgtttgtaa 
ttatataata tacttcattt 



<211> 
<212> 
<213> 
<220> 
<223> 



aacatgtcag gncatacatt rttccttctg 
tagaatgatg taaagttttg aataagttga 

tatgagattc atcatcacat gagacagcaa 
ttagataaat atatgaaatg caagakccac 
gcctgggatg tgaagmaaag gcagggaacc 
ctctatctct atcacaatat ccaacaagct 
ttcatgcagt gcaaatcccc aaaggtaacc tttatccatt tcatggtgag 
aattttggca aatcatactg gtcacttatc tcaactttga gatgtgtttg 
taattgaaag aaatagggca ctcttgtgag ccactttagg gttcactcct 
aatttacaaa ga 
<210> SEQ ID NO: 4 
LENGTH: 820 
TYPE: DNA 

ORGANISM: Unknown Organism 
FEATURE: 

OTHER INFORMATION: Description of Unknown Organism: PCA3 
<4 00> SEQUENCE: 4 

agaagctggc atcagaaaaa cagaggggag atttgtgtgg ctgcagccga gggagaccag 



acaggattaa 
ttaattacat 
tttgattttt 
cacagcctct 
cctaaacaaa 
cctgagaagc 
ctatcttact 
atactaaaag 
agagggaatg 
tcatagtatc 
tttcacagaa 
tgcgctttag 
tccttgtagt 
ggcaataaag 



660 
720 
780 
840 
900 
960 
1020 
1080 
114 0 
1200 
1260 



gaagatctgc 
gactttacca 
agcaagatga 
aaatatccac 
ggccgccatc 
ggacattaga 
gtggatattt 
aatgagagga 
tggaatactg 
gtcaggattc 
taaccctcaa 
ttctccatat 
ctttctacac 
<210> SEQ 
<211> 
<212> 
<213> 
<220> 
<223> 



atggtgggaa ggacctgatg atacagaggt gagaaataag aaaggctgct 



tctgaggcca cacatctgct gaaatggaga 
caatataatg tctaagtagt gacatgtttt 
acacacagga agcacaaaag gaagcacaga 
gatgagcctc gccctgtgcc 
atgtgttcct taaaggatgg 
gattacagat ttgaaatgaa 
gaaaatcttg atggcttcac 
aggcagccaa gctggggagg 
gcctaaactg tgcgttcata 
ttgtaatatc tgatctctac 
actcattttt aatatttagt 
cattactcat tttgttcaaa 



ttgggtcatc 
aaatgaattg 
atttgaacgg 
aaacagacga 
tgatgacatg 
tggccctgct 
aacaaagctg 
atccagccac 
tgtagaataa 
ID NO: 5 
LENGTH: 11 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION : Description of Artificial Sequence 



taattaacat cactagaaac 
tgcacatttc cagccccttt 
gatccctggg agaaatgccc 
tggtcccgct tgtgagggaa 
gcaggaaaac agatcctgtt 
gtcacaaagt gagcattacc 
aagacatgca acaaacaaaa 
agataaccac ggggcagagg 
accaaatcat ttcatatttc 
ggttccttct gggcccaaca 
tcccagatct gtactgtgac 



1500 
1560 
1620 
1680 
1740 
1800 
1860 
1872 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

820 



antigenic 
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RAW SEQUENCE LISTING DATE: 08/29/2000 

PATENT APPLICATION: US/09/402,713 TIME: 16:50:53 

Input Set : A:\seqlist-1619 0020001.txt 
Output Set: N:\CRF3\08292000\l402713.raw 

212 peptide 

214 <4 00> SEQUENCE: 5 

215 His Thr Gin Glu Ala Gin Lys Glu Ala Gin Arg 

216 15 10 

219 <210> SEQ ID NO: 6 

220 <211> LENGTH: 3582 

221 <212> TYPE: DNA 

222 <213> ORGANISM: Unknown Organism 
224 <220> FEATURE: 

22 5 <221> NAME/KEY: CDS 

226 <222> LOCATION: (401).. (553) 

228 <220> FEATURE: 

229 <223> OTHER INFORMATION: Description of Unknown Organism: PCA3 

231 <400> SEQUENCE: 6 

232 acagaagaaa tagcaagtgc cgagaagctg gcatcagaaa aacagagggg agatttgtgt 60 
2 34 ggctgcagcc gagggagacc aggaagatct gcatggtggg aaggacctga tgatacagag 120 
236 gaattacaac acatatactt agtgtttcaa tgaacaccaa gataaataag tgaagagcta 180 
2 38 gtccgctgtg agtctcctca gtgacacagg gctggatcac catcgacggc actttctgag 240 
240 tactcagtgc agcaaagaaa gactacagac atctcaatgg caggggtgag aaataagaaa 300 
242 ggctgctgac tttaccatct gaggccacac atctgctgaa atggagataa ttaacatcac 360 

244 tagaaacagc aagatgacaa tataatgtct aagtagtgac atg ttt ttg cac att 415 

245 Met Phe Leu His lie 

246 1 5 

248 tec age ccc ttt aaa tat cca cac aca cag gaa gca caa aag gaa gca 463 

249 Ser Ser Pro Phe Lys Tyr Pro His Thr Gin Glu Ala Gin Lys Glu Ala 

250 10 15 20 

252 cag aga tec ctg gga gaa atg ccc ggc cgc cat ctt ggg tea teg atg 511 

2 53 Gin Arg Ser Leu Gly Glu Met Pro Gly Arg His Leu Gly Ser Ser Met 

254 25 30 35 

2 56 age etc gee ctg tgc ctg gtc ccg ctt gtg agg gaa gga cat 553 

2 57 Ser Leu Ala Leu Cys Leu Val Pro Leu Val Arg Glu Gly His 

258 40 45 50 

260 tagaaaatga attgatgtgt tccttaaagg atgggcagga aaacagatcc tgttgtggat 613 

262 atttatttga aegggattae agatttgaaa tgaagtcaca aagtgagcat taccaatgag 673 

264 aggaaaacag acgagaaaat cttgatggct tcacaagaca tgcaacaaac aaaatggaat 733 

266 actgtgatga catgaggcag ccaagctggg gaggagataa ccacggggca gagggtcagg 793 

268 attctggccc tgctgcctaa actgtgcgtt cataaccaaa tcatttcata tttctaaccc 853 

270 tcaaaacaaa gctgttgtaa tatctgatct ctacggttcc ttctgggccc aacattctcc 913 

272 atatatccag ccacactcat ttttaatatt tagttcccag atctgtactg tgacctttct 973 

274 acactgtaga ataacattac tcattttgtt caaagaccct tcgtgttgct gectaatatg 1033 

276 tagctgactg tttttcctaa ggagtgttct ggcccagggg atctgtgaac aggctgggaa • 1093 

278 gcatctcaag atctttccag ggttatactt actagcacac agcatgatca ttacggagtg 1153 

280 aattatctaa tcaacatcat cctcagtgtc tttgeccata ctgaaattca tttcccactt 1213 

282 ttgtgcccat tctcaagacc tcaaaatgtc attccattaa tatcacagga ttaacttttt 1273 

284 tttttaacct ggaagaattc aatgttacat gcagctatgg gaatttaatt acatattttg 1333 

286 ttttccagtg caaagatgac taagtccttt atccctcccc tttgtttgat tttttttcca 1393 

288 gtataaagtt aaaatgetta gecttgtact gaggctgtat acagcacagc ctctccccat 1453 

290 ccctccagcc ttatctgtca tcaccatcaa cccctcccat accacctaaa caaaatctaa 1513 

292 cttgtaattc cttgaacatg tcaggacata cattattcct tetgectgag aagctcttcc 1573 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/402,713 



DATE : 08/29/2000 
TIME: 16:50:53 



294 
296 
298 
300 
302 
304 
306 
308 
310 
312 
314 
316 
318 
320 
322 
324 
326 
328 
330 
332 
334 
336 
338 
340 
342 
344 
346 
348 
350 



356 
358 
360 
363 
364 
365 
366 



Input Set : 
Output Set: 



A:\seqlist-1619 0020001.txt 
N:\CRF3\08292000\I402713.raw 



ttgtctctta 
aaagaaggga 
ttgattataa 
gggcacgttt 
aatatacttc 
cagtgcaaat 



ccccaaaggt 



aatctagaat gatgtaaagt tttgaataag ttgactatct 
cacatatgag attcatcatc acatgagaca gcaaatacta 
gagtttagat aaatatatga aatgcaagag ccacagaggg 
gtaagcctgg gatgtgaagc aaaggcaggg 
atttctctat ctctatcaca atatccaaca 
aacctttatc catttcatgg 
tatctcaact ttgagatgtg 
tgagccactt tagggttcac 
gttgttaaga gctctgtgtg 
tctctctctt gacccattat 
gttttcaaat ggcactatga gctgccaatg atgtatcacc 
agtaaatgtg ataataatgt catctgttaa cataaaaaaa 
ctggaaatgg acaaccacaa tatgcataaa tctaactcct accatcagct acacactgct 
tgacatatat tgttagaagc acctcgcatt tgtgggttct cttaagcaaa atacttgcat 
ctggggctgt gcatcaggcg gtttgagaaa tattcaattc tcagcagaag 
aattccctca tcttttagga atcatttacc aggtttggag 
gctttcacta atgtctctga acttctgtcc ctctttgtgt 
aatgttatct ttgaactgat gctcatagga gagaatataa 
attagggatt caaagaaata ttagatttaa gctcacactg 
aaagaactct gagctgtcat cgtccccatc 



ggcaaatcat actggtcact 
aaagaaatag ggcactcttg 
caaagagcta ctcaggacca 
tgtacatgcc aaagtgtgcc 



aacctcatag 
agcttttcac 
tgagtgcgct 
tttgtccttg 
tcctggcaat 
tgtgtgtgtg 
ttcagactta 
accatatctc 
gtttgacttc 



tacttcatgc 
aaagtgtaat 
aatgtttatg 
tatcttatat 
agaattcatg 
ttagaatttt 
tagttaattg 
aaagaattta 
tgtgtgtgag 
aaacaagcat 
attattctcc 
acaaaagcag 



taggtctcag 
ccagaatttg 
cagctcaggt 
tccaataaat 
tgatatcaac 
accaagatac 
gcaggaccca 
tctcctatta 
ttctagcctt 
ctctgctctg 
ccatccccac 
ctgcctatgg 
aaatccaact 
attgactgaa 
tggatcatgc 



acgcatgtct gagatcctta 
tggatgctag cttctggcca 



aatcaaggaa 
tctctggctc 
tttctccaac 
agaatttcaa 
acccttattt 
acattagata 



aggattcaga 
tcatggatag 
gaactctgag 
gtcaaaagga 
acaaccaaca 
tgagttgaat 
acatattagc 



tctgtgagcc 
accagtgtca 
tcctcttgac 

acatcgctta ccaatcctct 

cgactctcaa gtcttttctt 

ttattaattt ccaatagatg 

tttaaagtct aagaggttca 

acctcccctg ctcctctccc tatattactg 

cagtcaaatg agaaacccag tggctccttg 



tgcttccacg acttttatct 
ttgctttgga cttccccaca 
cactaacctg aattgcctag 
gctaatattg ctttagatga 
cattatcttc tctttctttc 
caggatggtc cccaagatgc 

atgcaagact gctgaagcca gaggatgact gattacgcct catgggtgga 
352 ggggaccact cctgggcctt cgtgattgtc aggagcaaga cctgagatgc tccctgcctt 
3 54 cagtgtcctc tgcatctccc ctttctaatg aagatccata gaatttgcta catttgagaa 
ttccaattag gaactcacat gttttatctg ccctatcaat tttttaaact tgctgaaaat 
taagtttttt caaaatctgt ccttgtaaat tactttttct tacagtgtct tggcatacta 
tatcaacttt gattctttgt tacaacttt 
<210> SEQ ID NO: 7 
<211> LENGTH : 51 

<212> TYPE: PRT **• 
PP'VflNT^H ■ JJnknown Organism 



1633 
1693 
1753 
1813 
1873 
1933 
1993 
2053 
2113 
2173 
2233 
2293 
2353 
2413 
2473 
2533 
2593 
2653 
2713 
2773 
2833 
2893 
2953 
3013 
3073 
3133 
3193 
3253 
3313 
3373 
3433 
3493 
3553 
3582 



36 
367 
369 
370 
371 
373 
374 
376 
377 
379 
380 



7C<220> 



INFDRj 



> PEATUR 
<223> OTHER 
<400> SEQUENCE: 
Met Phe Leu His 



Ala Gin Lys 



Leu Gly 

Glu Gly 
50 



Ser 
35 
His 



Glu 
20 
Ser 



RMATION: 
7 

He Ser 
5 

Ala Gin 



Met Ser Leu 



Description of Unknown Organism: PCA3 

Ser Pro Phe Lys Tyr Pro His Thr Gin Glu 

10 15 

Arg Ser Leu Gly Glu Met Pro Gly Arg His 

25 30 

Ala Leu Cys Leu Val Pro Leu Val Arg 
40 45 
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VERIFICATION SUMMARY 
PATENT APPLICATION: 



US/09/402, 713 



DATE: 08/29/2000 
TIME: 16:50:54 



Input Set : A:\seqlist-1619 0020001.txt 
Output Set: N:\CRF3\08292000\I402713.raw 



L:ll M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:95 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 1 
L:97 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 1 
L:120 M:258 W: Mandatory Feature missing, <220> FEATURE : 
L:16-7 M:258 W: Mandatory Feature missing, <221> not found for SEQ ID#:3 
L:167 M:258 W: Mandatory Feature missing, <222> not found for SEQ ID#:3 
L:167 M:340 W: (46) "n" or "Xaa" used: Feature required, for SEQ ID# : 3 
L:168 M:258 W: Mandatory Feature missing, <221> not found for SEQ ID# : 3 
L:168 M:258 W: Mandatory Feature missing, <222> not found for SEQ IDtf:3 
M:340 Repeated in SeqNo=3 

L:169 M:258 W: Mandatory Feature missing, <221> not found for SEQ ID#:3 
L:169 M:258 W: Mandatory Feature missing, <222> not found for SEQ ID#:3 
L:367 M:258 W: Mandatory Feature missing, <220> FEATURE : 
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